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Abstract 

As a first step towards a mathematically rigorous understanding of adaptive spectral//ip 
discretizations of elliptic boundary-value problems, we study the performance of adaptive 
Legendre-Galerkin methods in one space dimension. These methods offer unlimited approx- 
imation power only restricted by solution and data regularity. Our investigation is inspired 
by a similar study that we recently carried out for Fourier-Galerkin methods in a periodic 
box. We first consider an "ideal" algorithm, which we prove to be convergent at a fixed 
rate. Next we enhance its performance, consistently with the expected fast error decay of 
high-order methods, by activating a larger set of degrees of freedom at each iteration. We 
guarantee optimality (in the non-linear approximation sense) by incorporating a coarsening 
step. Optimality is measured in terms of certain sparsity classes of the Gevrey type, which 
describe a (sub-)exponential decay of the best approximation error. 
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1 Introduction 

The mathematical theory of adaptive algorithms for approximating the solution of multidimen- 
sional elliptic PDEs is rather recent. The first convergence results of adaptive finite element 
methods (AFEM) have been proved by Dorfler [12] and Morin, Nochetto, and Siebert [15]. On 
the other hand, the first convergence rates were derived by Cohen, Dahmen, and DeVore [7] for 
wavelets in any dimensions d, and for finite element methods (AFEM) by Binev, Dahmen, and 
DeVore [3] for d = 2 and Stevenson |19j for any d. The most comprehensive results for AFEM are 
those of Gascon, Kreuzer, Nochetto, and Siebert [6] for any d and data, and Cohen, DeVore, 
and Nochetto [8j for d = 2 and data. The key result of this theory is that AFEM delivers a 
convergence rate compatible with that of the approximation classes where the solution and data 
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belong. The recent results in [8| reveal that it is the approximation class of the solution that 
really matters. However, in all the above cases (practical wavelets and FEM) the convergence 
rates are limited by the approximation power of the method, which is finite and related to the 
polynomial degree of the basis functions or the number of their vanishing moments, as well as 
the regularity of the solution and data. We refer to the surveys pS] by Nochetto, Siebert and 
Veeser for AFEM and [20j by Stevenson for adaptive wavelets. 

A similar study for adaptive high-order methods (such as spectral element methods or hp 
finite element methods) has started only recently and very few results are available. The ultimate 
goal is to design algorithms which optimally choose between /i-refinement and p-enrichment, and 
for which a rigorous mathematical proof of rate of convergence and optimality can be given. This 
is a formidable task which requires, among other things, the study of sparsity classes tailored 
to functions which are (locally) infinitely smooth. Some rigorous mathematical results on the 
convergence of /ip-methods for PDEs have been established recently in [181 ITSl [4]. Although 
the numerical implementations of adaptive hp methods has now reached high levels of efficiency 
(see, e.g., [I?]), the theoretical study of optimality has never been addressed. A first step in 
this direction has been accomplished in [5], where the contraction and the optimal cardinality 
properties of adaptive Fourier-Galerkin methods in a periodic box in with dimension d > 1 
are presented together with the analysis of suitable nonlinear approximation classes (the classical 
one corresponding to algebraic decay of the Fourier coefficients and another one associated with 
exponential decay). 

The present paper represents a second step towards the study of optimality for high-order 
methods. We extend the method and the results contained in [5j to a non-periodic setting in 
one dimension. This is the closest situation to the periodic case, since an orthonormal basis 
is readily available (the so called Babuska-Shen basis formed by the anti-derivatives of the 
Legendre polynomials); together with the associated dual basis, it allows one to represent the 
norm of a function or a functional as a ^2-type norm of the vector of its coefficients. In addition, 
the stiffness matrix for smooth coefficients of the differential operator exhibits a quasi-sparsity 
behavior, i.e., an exponential decay of its entries as one goes away from the diagonal. 

In this paper we only consider the case of an exponential decay of the best approximation 
error of the solution of the PDE; indeed this is the most relevant situation which motivates the 
use of a spectral /p method. Our approach relies on a careful analysis of the relation between the 
sparsity class of a function and the sparsity class of its image through the differential operator. 
As already pointed out in the analysis of the Fourier method [5], the discrepancy between the 
sparsity classes of the residual and the exact solution suggests the introduction of a coarsening 
step that guarantees the optimality of the computed approximation at the end of each adaptive 
iteration. 

The multi-dimensional situation, which poses additional difficulties, is currently under in- 
vestigation. 

2 Legendre and Babuska-Shen bases 

Let / = (—1,1) and Lk{x), A; > 0, be the A:-th Legendre orthogonal polynomial in /, which 
satisfies degL^ = k, L].{1) = 1 and 




m > . 
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Furthermore, we denote by 

Mx) = VkTT/2 Lk{x) , k>0, 
the elements of the orthonormal Legendre basis in L'^{I), which satisfy 

4>k{x)(t>m{x)dx = 5km , m>0. 



I 



We denote by -D = d/dx the first derivative operator. The natural modal basis in Hq(I) is the 
Babuska-Shen basis (BS basis), whose elements are defined as 
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%(x) = Vk-1/2 J Lk-i{s) ds = -^^=^ {Lk-2{x) - Lk{x)) k>2; (2.1) 
they satisfies deg% = k and 

Dr,k = -A-i ■ (2.2) 

Thus, the r/^'s satisfy 

iVk,Tlm)H^{i) = j^Dr]kix)Dr]mix)dx = 6km , k,m>2, (2.3) 

i.e., they form an orthonormal basis for the ffQ(/)-inner product. 

Equivalently, the (semi-infinite) stiffness matrix Sr^ of the Babuska-Shen basis with respect 
to this inner product is the identity matrix /. On the other hand, one has 



(2fc-3)(2fe+i) ifm - k , 

- (2.+i)V(2Li)(2.+3) if"^ = fc + 2, for^>m, (2.4) 
elsewhere. 



which means that the mass matrix iVf^ is pentadiagonal. (Since even and odd modes are mu- 
tually orthogonal, the mass matrix could be equivalently represented by a couple of tridiagonal 
matrices, each one collecting the inner products of all modes with equal parity). For every 
V G Hq{I) we have 

oo 

v{x) = 

k=2 

with Vk = j\ Dv{x)Dr]k{x)dx . In view of the results in the next sections, we observe that (12. 2p 
yields 

oo oo 

Dv = ^^VkDr]k = -^^Vk(l)k-i ] (2.5) 

k=2 k=2 

comparing this expression with 

oo 

Dv = Y,{Dv)^.<Ph 

h=l 

yields 

iDv)^: = -Vh+i V/i > 1 . (2.6) 
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Prom ([231), there follows that the H^{I) -norm can be expressed, according to the Parseval 
identity, as 

oo 

'\v\\'iun='^\vk\'^ = v'^v , (2.7) 



k=2 



where the vector v = (vk) collects the coefficients of v. The L (/)-norm of v is given by 

\\v\\l2^j^=V^Mr,V . (2.8) 

Correspondingly, any element / G H^^{I) can be expanded in terms of the dual Babuska-Shen 
basis, whose elements r/^, k >2, are defined by the conditions 

{Vlv)=vk Vz;Gi7(l(/) ; 

precisely one has 

oo 

/ = X] f'^'^k ' with fk = if, , 
k=2 

and its F"^/) -norm can be expressed, according to the Parseval identity, as 



H-M/) = ElAl'- (2-9) 

k=2 

Summarizing, we see that the one-dimensional Legendre case is similar, from the point of 
view of expansions and norm representations, to the Pourier case (see [5J). 

Throughout the paper, we will use the notation || . || to indicate both the //Q(/)-norm of a 
function v, or the H~^{I)-noTm of a linear form /; the specific meaning will be clear from the 
context. 

Moreover, given any finite index set AcN2:={A;GN: A;>2}, we define the subspace of 

Va := span{7/fc \ k e A} ; 

we set |A| = card A, so that dimVA = |A|. If g admits an expansion g = YlT=29kVk (converging 
in an appropriate norm), then we define its projection P\g onto V\ by setting 

P^g = X Qk-rik ■ 
keA 

3 The model problem and its Galerkin discretization 

We now consider the elliptic problem 

\Lu = -D-{vDu)+au = f in/, 
\n(-l)=n(l)=0, 

where v and a are sufficiently smooth real coefficients satisfying < z^^= < v{x) < v* < oo and 
< (t{x) < (7* < oo in /; let us set 

a* = and a* = max(z^*,(T*) . 
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We formulate this problem variationally as 

ueH^il) : a{u,v) = {f,v) e H^{I) , (3.2) 

where a{u,v) = j^uDuDv + JjUuv. We denote by It;! = a{v, v) the energy norm of any 
V G Hq{I), which satisfies 

-v/a*ll^ll ^ llblll ^ II • (3.3) 
Given any finite set A C N2, the Galerkin approximation is defined as 

ua^Va : a{uA,VA) = {f,VA) Vtja G Va • (3.4) 
For any it; G Va, we define the residual 

r{w) = f - Lw = '^rk{w)r]l , where fk{w) = {f-Lw,r]k) = {f,r]k)-a{w,r]k). 
k 

Then, the previous definition of ua is equivalent to the condition 

PAr{uA) = , i.e., rk{uA) = VA; G A . (3.5) 

On the other hand, by the continuity and coercivity of the bilinear form a, one has 

1 1 

— \\r{uA)\\ < II^^-^^aII < — \V{ua)\\ , (3.6) 
a* a* 

or, equivalently, 

1 1 

'k(iiA)|| < |||n-UA||| < ^=||r(uA)|| . (3.7) 



a* ^/a 

3.1 Algebraic representation and properties of the stiffness matrix 

Let us identify the solution u = UkVk of Problem (j3.2p with the vector u = (ufc) of its 
Babuska-Shen (BS) coefficients. Similarly, let us identify the right-hand side / with the vector 
f = (fi) of its BS coefficients. Finally, let us introduce the semi-infinite, symmetric and positive- 
definite matrix 

A = (ae^k) with ae,k = a{r]k,r]e)k,e>2 ■ (3.8) 
Then, Problem (j3.2p can be equivalently written as 



Au = f . (3.9) 
In order to study the properties of the matrix A, let us first recall the following result. 
Property 3.1 (product of Legendre polynomials). There holds 

min(m,n) 

Lm{x)Ln{x) = ^ A!^^nLm+n-2r{x) (3.10) 
r=0 

with 

^ Am-r^rAn-r 2n + 2m — 4r + 1 

'~ An+m-r 2n + 2m - 2r + 1 

and 

_ l-3-5...(2m-l) _ (2m)! 
° ' ml ~ 2™(m!)2 ' 
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Proof. See e.g. \T\. 

Bearing in mind (13. 8|) . in the sequel we will make use of the following notation 

with ^ ^ 

flm^n := j v{x)Dr]rn{x)Dr]n{x)dx and a^|„ := j a{x)r]raix)r]n{x)dx. 

If 2^(2;) = YlT=o ^kLk, then using (I3.10p it is possible to prove that 

j v{x)Drim+i{x)Drin+i{x)dx 
V(2m + l)(2n + l) /"i 



u{x)Lm{x)Ln{x)dx 
min(m,n) 

Bm,n^m+n-2r , (3-11) 

r=0 



where we set 



._ V(2m + l)(2n + l) 
'"''^ ■ 2m + 2n - 4r + 1 '"'^ ' ^ ' 



Property 3.2 (coefficients B^^). There holds 



B"" < 1 

m,n 

for every m,n > and < r < min(m, n). 

Proof. We first observe that Stirling's formula m! ~ \/27rme~'"m'" implies 

2™ 

^„ ~ ^= . (3.13) 
This can be used to prove asymptotic estimates for the factor ^"^-r^rA„-r ^£ j[r^^^- 



Case < r < min(m, n): 



A„+m-r 71" \/m - r^/n - r^/r 

Case r = 0: 



Case r = min(m, n) and m ^ n: 



^mm{m,n)^\m-n\ 1 y^max(m,n) 



^max(m,n) Y^min(m, n)^^\m — n\ 

When m = n it is sufficient to use j4o = 1 to get ^^^^ = 1. 



Combining these results with (j3.12p . we now estimate the quantities -B^n ^ follows: 
(a) Case < r < min(m, n): 

1 ^(2m + l)(2n + l) ^/n + m-r 



vr 2m + 2n-2r + 1 ^{jn - r){n - r)r 
1 \/mn 1 



vr yjm + n — r \Jm — r\/n — r^/r 
We note that the above last term can be asymptotically bounded by 

1 1 



5^ ~ 



71" ^y min(m + n,\m — n\) 
which amounts to considering the extremal cases r = 1 and r = min(m, n) — 1. 
(b) Case r = 0: 



(c) Case r = min(?n,, n) and m ^ n: 

nmin(m,n) ^ ^ 

m,n I — n r ■ 

\J\n — m\ 

When m = n we obtain -B^m ~ 1- 

Hence, by combining (a)-(c), there follows that the terms -B^n' < r < min(m, n) are asymp- 
totically bounded by a constant independent of m,n,r. □ 

Proposition 3.1 (decay of aln,n)- If there exists r] > and a positive constant Crj only depending 
on r] such that 

Wk\ < c^e-'i'' yk>0 , 

then there holds 

laWj < Ce-"!'^-"^! Vn,m>2, (3.14) 
where C is a constant only depending on r]. 
Proof. Using (jS.lip and Property 13.21 there follows 



min(m,,n) min(m,n) 

Wm+n-2r\ < ^ 
r=0 r=0 
min(m,n) 

^ —r]\m—n\ \ ^ — 2»?(min(m,n)— r) ^ ^r]\m,—n\ 
r=0 



This gives p.l4p as asserted. 
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Let a{x) = Yl'kLo '^kLk then using (j2.ip and (j3.10p it is possible to prove that 



a. 



'^m]n '■= I (T{x)'l]m{x)r]n{x)dx 

1 

a{x){Lrn-2ix) - (x)) (L„_2 (x) - Ln{x))dx 

1 



1 



V(2m- l)(2n- 1) 
1 

A/(2m- l)(2n- 1) 



min(m— 2,n— 2) 

C'm-2,n-2'^m-2+n-2-2r 

r=0 



where 



min(ni— 2,n) min(m,n— 2) 

+ X/ C'^-2,n<^ni-2+n-2r + ^ C'^,n-2<^ni+n-2-2r 
r=0 r=0 
min(r?i,ra) | 

} , (3.15) 

r=0 I 



2m + 2n - 4r + 1 
Property 3.3 (coefficients CJ^^^). There holds 

1 



C I, • < 1 



Y/(2m- l)(2n- 1) 
wzi/i j = 0,2 and m,n > and < r < min(m — k,n — j). 



Proof. Let us first consider the case k,j = 0. Proceeding as in the proof of Property 13.21 we get 
the following cases for the au 

(a) Case < r < min(m, n): 



the following cases for the auxiliary quantity DL, „ = , ^ C. 

^ J- M J m,n ^(2m,-l)(2n-l) ' 



rri'n ' 



1 1 1 \Jn + m — r 



vr y/(2m - l)(2n - 1) 2m + 2n - 2r + 1 y^(m - r)(n - r)r 

(b) Case r = 0: 

" 7^ ^(2m - l)(2n - 1) 2m + 2n + 1 ' 

(c) Case r = min(m, n): 



/mn 



11 1 max(n, m) 



m,n 



Y/(2m - l)(2n - 1) 2 max(m, n) + 1 Y/min(m, n)y^|m - n\ 



We note that each right-hand side term in (a)-(c) is asymptotically bounded by a constant. 
Therefore the quantities D^^, < r < min(m,n) are asymptotically bounded by a constant 
independent of n, m, r. The other terms in (jS.lSp can be treated similarly. □ 
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Proposition 3.2 (decay of ami'n)- If there exists r] > and a positive constant Cjj only depending 
on rj such that 

Wk\ < Crjc-'^'' yk>o , 

then there holds 

\a^^]n\ < Ce-^l"-"^l Vn, m > , (3.16) 
where C is a constant only depending on ij. 

Proof. Use ()3.15p together with Property 13.31 and argue as in the proof of Proposition 13.11 □ 

Combining Propositions 13.11 and 13.21 yields 

Corollary 3.1 (decay of am,n)- If there exists rj > and a positive constant Cr^ only depending 
on rj such that 

kfcl, kfcl < C^e'''^ V/c > , 

then there holds 

\am,n\ < Ce-^l"-"*! Vn,m > 2 , (3.17) 

where C is a constant only depending on r}. 

Correspondingly, the matrix A belongs to the following class. 

Definition 3.1 (regularity classes for A). A matrix A is said to belong to the exponential class 
T^eivi) if there exists a constant cl > such that its elements satisfy 

Iflm.nl < CLe"''^!™-'"! m,n>2. (3.18) 

The following properties hold. 

Property 3.4 (continuity of A). If A £ T)(,{r]L), then A defines a bounded operator on ^^(N2). 

Proof. It is sufhcient to extend the semi- infinite matrix A = {ae^k)i,ken2 to a bi-infinite matrix 
A = ia£^k)i,kez such that it corresponds to the identity matrix for € Z \ N2 and it is equal 
to A otherwise. Then proceed as in [HI [10]. □ 

Property 3.5 (inverse of A). If A £ T>e{rjL) and there exists a constant cl satisfying (13.18P 
such that 

CL < -{e'^'' -l)minae,e, (3.19) 

then A is invertible in £^(N2) and A~^ € Ve{fjL) where f]L S (0,r/i] is such that z = e~^^ is the 
unique zero in the interval (0,1) of the polynomial 

e2vL + 2CL + 1 , ^ 

Z -. + 1 . 

e-nL{cL + l) 

Proof. See [5i Property 2.3]. □ 

For any integer J > 0, let Aj denote the following symmetric truncation of the matrix A 

^ if\£-k\<J, 
[Aj)e,k - < . , (3.20) 

[0 elsewhere. 

Then, we have the following well-known results. 
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Property 3.6 (truncation). The truncated matrix Aj has a number of non-vanishing entries 
bounded by 2 J. Moreover, under the assumption of Property \3.4[ there exists a constant Ca 
such that 

\\A-Aj\\<iPa{J,v) :=CAe-^^^ 
for all J > 0. Consequently, under the assumptions of Property \3.5\ one has 

||A-i-(A-i)j|| < Va-i(^,??l) (3.21) 
where we let f]L be defined in Propertu \3.5[ 

Proof. See [5, Property 2.4]. □ 



4 Towards an adaptive algorithm 

In order to design an adaptive algorithm with optimal convergence and complexity properties, 
we start by considering an ideal one. This will serve as a reference to discuss the most relevant 
aspects which have to be taken into account in designing the final algorithm. The ideal algorithm 
uses as error estimator the ideal one, i.e., the norm of the residual in H^^{I). We thus set, for 
any v G Hq{I), 

^^{v) = \\r{v)f= \rk{v)\\ (4.1) 

SO that (|3.6p can be rephrased as 

1 1 

—r]{uA) < \\u - ua\\ < — r]{uA) . (4.2) 
a* a* 

Obviously, this estimator is hardly computable in practice. However, by introducing suitable 
polynomial approximations of the coefficients and the right-hand side, it is possible to consider 
a feasible version. In the sequel we will not pursue this possibility, but we refer to [5j for the 
details. Given any subset A C N2, we also define the quantity 

r^\v;A) = \\PMv)f = Y,Mv)\\ 

fcGA 

so that ri{v) = r/(t>;N2). 

We now introduce the following procedures, which will enter the definition of all our adaptive 
algorithms. 

• UA := GAL (A) 

Given a finite subset A C N2 , the output ua G Va is the solution of the Galerkin problem 
([33]) relative to A. 

• r := RES(?;a) 

Given a function va & Va for some finite index set A, the output r is the residual r{vA) = 
f - LvA- 
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A* := DORFLER(r, 9) 

Given 6 G (0, 1) and an element r G H^^{I), the ouput A* C N2 is a finite set of minimal 
cardinality such that the inequality 

||PA.r|| > e||r|| (4.3) 

is satisfied. 



Note that the latter inequality is equivalent to 

||r -PA*r|| < a/I - 6l2||r|| . (4.4) 

If r = r{u\) is the residual of a Galerkin solution u\ £ Va, then by (j3.5p we can trivially assume 
that A* is contained in A"^ := N2 \ A. For such a residual, inequality (j4.3p can then be stated as 

v{uA;A*)>e7]{uA) (4.5) 

Dorfler marking (or bulk chasing). Writing = rkiuh), the condition (|4.5p can be equiva- 
lently stated as 

^ \h\'>0'Y.\^k\'. (4.6) 

fceA* fc^A 

Thus, the output set A* of minimal cardinality can be immediately determined if the coefficients 
ffc are rearranged in non-increasing order of modulus. The cardinality of A* depends on the rate 
of decay of the rearranged coefficients, i.e., on the sparsity of the representation of the residual 
in the chosen basis. 

Given two parameters 6 £ (0, 1) and tol £ [0, 1), we are ready to define our ideal adaptive 
algorithm. 

Algorithm ADLEG(6', tol) 
Set ro := /, Aq := 0, n = -1 
do 

n n + 1 

dAn := D6RFLER(r„,6') 

A„+i := A„ U dAn 
Un+i := GAL(A„+i) 
r„+i := RES(ti„+i) 

while ||?'n+i|| > tol 

This algorithm is the non-periodic counterpart of the ideal algorithm ADFOUR considered 
in [S]. The same proof given therein yields the following result, which states the convergence of 
the algorithm with a guaranteed error reduction rate. 
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Theorem 4.1 (contraction property of ADLEG). Let us set 

P = p{e) = ^l-^9^G{0,1). (4.7) 

Let {An, u„}n>o be the sequence generated by the adaptive algorithm ADLEG. Then, the fol- 
lowing bound holds for any n: 

|||n - n„+i||| < pju - Un\l . 

Thus, for any tol > the algorithm terminates in a finite number of iterations, whereas for 
tol = the sequence Un converges to u in H^{L) as n —)■ oo. □ 

At this point, some comments are in order. 

• The predicted error reduction rate p = p{6), being bounded from below by the quantity 
^y^^-^^i looks overly pessimistic, particularly in the case of smooth (analytic) solutions. 
Indeed, in this case a spectral (Legendre) Galerkin method is expected to exhibit very fast 
(exponential) error decay. For this reason, we will introduce in Section 17.31 a variant of 
the Dorfler procedure which - through a suitable enrichment of the set of new degrees of 
freedom detected by the usual Dorfler - will guarantee an arbitrarily large error reduction 
per iteration. 

• The complexity analysis of the algorithm requires to relate the current error Sn ■= — tin III 
to the cardinality of the set A„ of the activated degrees of freedom, having as a benchmark 
the best approximation (i.e. the one achieved with the minimal number of degrees of 
freedom) of the exact solution u up to an error given exactly by e„. This requires to 
investigate the sparsity class of the solution u, a task that will be accomplished in Section 
O we will confine ourselves to the case of infinite differentiable functions (including analytic 
ones) for which a natural framework is provided by Gevrey spaces. (The analysis of the 
case of finite smoothness can be carried out by extending the arguments presented in [5].) 

• The cardinality of the set A„ of active degrees of freedom selected by the Dorfler procedure 
may be estimated in terms of the sparsity class of the residual, rather than the one of the 
solution. If the residual is less sparse than the solution, we run into a potential situation 
of non-optimality. This is precisely what happens for the Gevrey-type sparsity classes, as 
pointed out in Section [6l For this reason, we will incorporate in our algorithm a coarsening 
step, introduced in Section [721 to bring the cardinality of the active degrees of freedom at 
the end of each iteration to be comparable with the optimal one dictated by the sparsity 
class of the solution. 

5 Nonlinear approximation in Gevrey spaces 

At first, we consider Gevrey spaces of linear type and then, through the concept of nonlinear 
approximation, we will introduce sparsity classes of functions related to Gevrey spaces. 

5.1 Gevrey classes and their properties 

We recall the following definition of classical Gevrey space. 
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Definition 5.1 (definition of G^{I)). For any t G (0,1], we denote by G^{I) the space of C°° 
functions v in a neighborhood of I for which there exist a constant L >0 such that for any n > 

\\D^vU2(^j) < L"+\n\)'/' . (5.1) 

The choice t = 1 yields the usual class £/ (I) = (I) of analytic functions in a neighborhood 
of /. 

Another family of spaces of Gevrey type has been introduced in [2] by relaxing the require- 
ment on the growth of the derivatives near the boundary. In our simple one-dimensional setting, 
its definition is a follows. 

Definition 5.2 (definition of Let £ denote the Legendre operator in I 

Cv = -D{{1 - x^)D)v ; 

and, for any k > 0, let us set 



RkV 



£Pv ifk = 2p, 

{l-x'^)DCPv ifk = 2p + l 



Then for any t £ (0, 1], we denote by A*{I) the space of C°° functions v in a neighborhood of I 
for which there exist a constant L > such that for any k >0 

WRkvU^^j^KL^+Hkl)^/' . (5.2) 

The relation between the two families is given by the inclusion 

G\I) C A\I) C G^{I) , (5.3) 

where the first inclusion is an immediate consequence of the fact that 1 — is bounded with 
its derivatives in /, whereas the second one is proved in [2]. 

Let Fn = {4>o, • • • jfpn} be the space of polynomials of degree < n in I and let d2{v,Fn) = 
infpgp^ \\v — p\\l^[i) be the distance of a function v G L'^{I) to P„,. 

Proposition 5.1. Let v G L^{I). The following conditions are equivalent: 

(i) V G £^\L). 

(a) There exist Li > and r]i > such that 

d2(v,P„) < Lie-^''!"' Vn>0. (5.4) 

(Hi) There exist L2 > and 772 > such that the Legendre coefficients of v = '^^=QVk4'k satisfy 

\vk\ < L2e"''2fc* vA: > 0. (5.5) 

Proof. The equivalence (i) o (ii) is proved in [2, Thm. 7.2]. In order to prove the equivalence 
(ii) o {Hi), we observe that 

/ \ 1/2 

d2{v,Fn) = \\v -^Vk(t>k\\L^(I) = {^\vk\'^ j ■ (5.6) 
fc=0 \k>n / 
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Thus, if (ii) holds, we have 

\vn+i\ < d2{v,Fn) < Lie-^^'^'. (5.7) 
Now, recalHng that (a + 6)* < + for a,b >0 and t G (0, 1], we have 

g-r;in' _ gf^i ^-771 (n'+l) _ (n* + l') <; g-»?l ("+!)* (^5^3) 

which yields (iii) with L2 = e^^Li and r/2 = r/i- Conversely, let us assume that (iii) holds. 
Then, using again (|5.4|) and setting for simplicity = 772, we have 

/+00 
e-^^y'dy . (5.9) 

Now, setting z = and s = 1/t > 1, we have 

/+00 r+oo 

(5.10) 

in* Jo 

The last integral can be bounded by a polynomial of degree [s — 1] . It follows that for any 
r/i < r/ = T/2 we can find Cy^^^s > such that 

thus condition (ii) is satisfied. □ 

Remark 5.1. As a consequence of the previous proposition and the inclusion G*{I) Q j2/*(/), 
any Gevrey function v G G^{I) admits Legendre coefficients Vk which decay according to the law 
(1531). 



For analytic functions, the results in [ii) and {in) of Proposition 15.11 can be made more 
precise as follows. 

Proposition 5.2. Let v G L'^{I) be analytic in the closed ellipse £r D [—1,1] in the complex 
plane C with foci in ±1 and semiaxes' sum equal to r > 1. Then, one has 

\vk\ < C{r)^/2{2k + l)e-'^'' VA: > , (5.11) 

and 

'i2(t',Fn) < (5(r)(n + l)^/2g-r?n Vn > (5.12) 

for constants C{r) and C{r) only depending on r and r] = logr. 

Proof. The estimate (j5.1ip is a consequence of the bound 

\vk\ <C{r){2k + l)e-'^'' k>0 

given in pT, Theorem 12.4.7], where = {k+l/2) J^^ v{x)Lk{x)dx and satisfies = ^Jk + l/2vk- 
On the other hand, 

d2{v,¥nf = <2C2(r)^(2fc + l)e-2'?^ 

k>n k>n 

f+00 C'^(r) 

< 2C\r) / (2y + 3)6-2^^2/ < 6— ^(n + 1)6-^''", 
Jn V 

and ^J2h follows. □ 
There follows that conditions (ii) and (Hi) are fulfilled with t = 1 and any ??i,r/2 < logr. 
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5.2 Nonlinear approximation and sparsity classes 

From now on, we represent any v G V according to the BS basis {?7fc}^2 &s v = ^-'kVki^)- 
We recall that given any nonempty finite index set A C N2 and the corresponding subspace 
Va. C F of dimension | A| = card A, the best approximation of w in is the orthogonal projection 
of V upon V\, i.e. the function P\v = YlkeA'^k'Hkj which satisfies 

x 1/2 

\v - Pav\\ = I ^ 

(we set P\v = if A = 0). For any integer A?^ > 1, we minimize this error over all possible 
choices of A with cardinality N, thereby leading to the best N-term approximation error 

En{v) = inf \\v — Pav\\ ■ 
ACN2, |A|=iV 

A way to construct a best N-term approximation vn of v consists of rearranging the coefficients 
of V in decreasing order of modulus 

\vkA > ... > \vkj > \vk„+A > ■■■ 

and setting vn = P^f^v with Ajv = {kn '■ ^ < n < N}. Let us denote from now on = Vk„ 
the rearranged BS coefficients of v. Then, 



en{v) = \\v - pajv'^ii = 



\v. 

n>N 



* |2 I 



We will be interested in classifying functions according to the decay law of their best N-term 
approximations, as N ^ 00. To this end, given a strictly decreasing function 4> :N ^ R+ such 
that (p{0) = (po for some (j)Q > and (f){N) — > when N — >■ 00, we introduce the corresponding 
sparsity class by setting 

A^ = {veV : ||^;|U, := sup < +oo| . (5.13) 

The quantity (which need not be a (quasi-)norm, since Acj) need not be a linear space) 

dictates the minimal number N^ of basis functions needed to approximate v with accuracy e. In 
fact, from the relations 

EnAv) < e < En,^i{v) < (j>{N, - 1)\\v\\a^ , 
and the monotonicity of (j), we obtain 



Hereafter, we focus on sparsity classes identified by exponentially decaying (f) of the type 
(1){N) = e~^^ for real numbers tj > and < t < 1. The following argument clarifies their 
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relationship with certain famihes of Gevrey spaces. Let us introduce the following spaces of 
Gevrey type: given real numbers 77 > and < i < 1, we set 

00 

G^'\I) = {veV: \\v\\G,r,,t := 5]e2'''=>fcp < +00} . (5.15) 

k=2 

We immediately observe that v G G^'^^I) implies 

Using (j2.6p . we deduce from Proposition 15.11 that the first derivative of a function v G G'^'*{I) 

belongs to the space defined in Definition 15. 2t in particular, by using (|5.3p . v' is a Gevrey 

t - 

function in G^-' (/). 

Functions in G^'^{I) can be approximated by the linear orthogonal projection on ¥^^^{I) := 

M 

Pmv = ^ Vkilk , 

k=2 

for which we have 

11^ - P,,vf = Y.\^''\'=I1 e-2^^*e2^^>,p 

k>M k>M 

< e-2'^*^' Yl e'^'l^fcl' < e~^''^''\\v\\h,rj,t ■ 

k>M 

Setting N := dim(P5j) = M - 1, we have 

En{v) < \\v - Pn+iv\\ < e-'^(^+i)' \\v\\G,r,,t < e-^""' \\v\\G,^^t ■ (5.16) 
Hence, we are led to introduce the function 

cj)[N) = e""^' {N>1) , (5.17) 

whose inverse is given by 

r^(A) = ^(logi)'^* (A<1), (5.18) 

and to consider the corresponding class A^f, defined in (j5.13p . which contains G'^'*(/). 
Definition 5.3 (exponential class of functions). We denote by Aq^ the set defined as 

^^*:= \veV : \\v\\,^^t := sup EN{v)e''^' < +00} . 

The class Aq^ can be equivalently characterized in terms of behavior of rearranged sequences 
of BS coefficients. 

Definition 5.4 (exponential class of sequences). Let £^*(N2) be the subset of sequences v G 
£2(N2) so that 

ll^ll<?2;'*{N2) (^n(^"*)/2exp (r?n*) \v^\^ < +00 , 

where v* = (v^)^^-^ is the non-increasing rearrangement of v. 
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The relationship between A!q and £^*(N2) is stated in the following Proposition, whose proof 
is a straightforward adaptation of the one given in [5, Proposition 4.2]. 



Proposition 5.3 (equivalence of exponential classes). Given a function v and the sequence 

,ri,t 



= {vk)k&i.2 of its BS coefficients, one has v £ A'q if and only if v £ £^^(f^2), with 



As shown in [5] the set £^*(N2) is not a vector space, since it may happen that u, v belong 
to this set, whereas u + v does not. On the other hand, we have the following property (see O 
Lemma 4.1]). 

Property 5.1 (quasi-triangle inequality). // Uj € ^JS'*(N2) for i = 1,2, then ui + U2 G £^^{^2) 
with 

_i -i -1 

||ui+U2||^.,t(P^^) < ||ui||^,i,t(^^^ + ||u2||^.2,t(j^^), r] t =r]^^ +r]^K 

6 Sparsity classes of the residual and the solution 

This section is devoted to the study of the sparsity class of the residual r = r{u\) produced by 
any Galerkin solution ua G Va, and its connection with the sparsity class of the exact solution 
u. Indeed, the step 

dA := D6RFLER(r, 6) 



selects a set dA of minimal cardinality in for which ||r — P^a'^II ^ \/l — ^^||?"||- Thus, if r 
belongs to a certain sparsity class identified by a function (p according to (|5.13p . then (j5.14p 
yields 

\dA\ < ( y^I^lM] + 1 . (6.1) 



Specifically, if r £ Aq for some rj > and t > 0, we have by (|5.18p 

i/t 



\r\\ ,fj,t 



|9A|<-i^(log^=^^) +1. (6.2) 



We begin by investigating the sparsity class of the image Lv, when the function v belongs 
to the sparsity class Aq* . It turns out that the sparsity classes of v and Lv are equivalent, in 
view of Proposition 15. 3^ to the sparsity classes of the related vectors v and Av, where A is the 
stiffness matrix (13.81). 



Proposition 6.1 (continuity of L in Aq^). Let the differential operator L be such that the 
corresponding stiffness matrix satisfies A G T)f>{r]L) for some constant rjL > 0. Assume that 
V G Aff for some 7] > and t £ (0, 1]. Let one of the two following set of conditions be satisfied. 

(a) If the matrix A is banded with 2p + 1 non-zero diagonals, let us set 

- V 

7] = r-r , t = t . 

' (2p+l)*' 
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(b) If the matrix A is dense, but the coefficients ijl and ry satisfy the inequality rj < rj^, let us 
set ^ 

fj = C{t)i] , i 



l + t 

where we define 



m ■■= (^y"-' VO<t<l. (6.3) 



Then, one has Lv G A^J" , with 



G 



Proof. The proof is an adaptation of a similar result in the periodic case given in [SI Proposition 
5.3]; we report here the details for completeness. Let Aj be the truncation of the stiffness matrix 
defined in ()3.20p : thus, by Property 13.61 we have II A - Aj II < Cac"''^-^, J > 0. On the other 
hand, for any j > 1, let Vj = Pj(v) be a best j-term approximation of v (with vq = 0), which 
therefore satisfies ||v — Vj|| < e"''-'' ||v||^r,,t^j^^y Note that the difference Vj — Vj_i consists of a 
single component and satisfies as well 



-vj-ill < e "J ||v||^M(p^^) . 

Finally, let us introduce the function x : N — t- N defined as x(i) = [j*! ? the smallest integer 
larger than or equal to j*. 

For any J > 1, let wj be the approximation of Av defined as 

J 

WJ = J^A^(j_j)(vj- - vj-_i) . 
i=i 



Writing v = v - vj + X]/=i(vj - Vj_i), we obt 



j=l\ - J ■ j-^n 

J 

Av - wj = A(v - vj) + ^(A - A^(j_j))(vj - Vj_i) . 

i=i 

We now assume to be in Case (b). Since A : £^(N2) — )• ^^(N2) is continuous, the last equation 
yields 

||Av-w,|| < ||e-^^*+^e-(''-r(^-^-)'l+''^-')j ||v||,...(^^). (6.5) 
The exponents of the addends can be bounded from below as follows because t < 1 

VL\{J-jY]+vj' = VL\{J-jY]-v{J-jY + v{{J-jY+j') 

> Vl{J - jf - V{J - jY + r]{{J - j) + jf 
= P{J-3)' + nJ\ 

with /3 = ?7i — ?7 > by assumption. Then, ()6.5p yields 

l|Av-w,|| < fl + X:e-^^-Me-''^'||v||,..(^^) < e"^^* ||v||,..(^^) . (6.6) 
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On the other hand, by construction wj belongs to a finite dimensional space Vaj, where 
I AjI < 2 ^ x( J - i) = 2 r/l ~ YT'/''' as J ^ oo . (6.7) 



This implies 



|Av-wj|| < e ''^^^■'^^M^^/{n^) 



G 



with t = and 77 = (^-^) ?? = C(0^ ^ asserted. 

We last consider Case (a). Since A^(j_j) = A if x{J ~ j) ^ for x{J ~ j) ^ whence 
j > J — p^/* , the summation in (|6.5p can be limited to those j satisfying jp < j < J, where 
jp = \J — p^/*] . Therefore 



|Av-wj|| < {e-^^' + Y^ 



|v|le-'{N2) 



j=jp 



Now, J — j < p^^^ if jp < j < J and > > ( J — p^/*)* > — p, whence 

||Av-w,|| < (l + e^He-^^'llvll^..^^^). 

We conclude by observing that |Aj| < {2p + 1) J, since any matrix Aj has at most 2p + 1 
diagonals. □ 

Before studying the sparsity class of the residual, it is convenient to rewrite the Galerkin 
problem (|3.4p in an equivalent (infinite-dimensional) way. To this end, let ua G M)^^ be the 
vector collecting the coefficients of ua indexed in A; let Fa G M'^' be the analogous restriction for 
the vector of the coefficients of /. Finally, denote by Ra the matrix that restricts a semi-infinite 
vector to the portion indexed in A, so that Ea = Ra corresponding extension matrix. 

Then, setting 

Aa = RaAR^ , (6.8) 
we preliminary observe that problem (j3.4p can be equivalently written as 

Aaua = fA . (6.9) 
Next, let Pa : ^^(N2) — ?■ ^^(N2) be the projector operator defined as 



(Pav); 



vx if A G A , 
if A ^ A . 



Note that Pa can be represented as a diagonal semi-infinite matrix whose diagonal elements 
are 1 for indexes belonging to A, and zero otherwise. Let us set Qa = I — Pa and introduce 
the semi-infinite matrix Aa := PaAPa + Qa which is equal to Aa for indexes in A and to 
the identity matrix, otherwise. The definitions of the projectors Pa and Qa easily yield the 
following result. 

Property 6.1 (invertibility of Aa). //A is invertible with A G T)(,{r]L), then the same holds 
for Aa. □ 
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Finally, the infinite dimensional version of the Galerkin problem (|3.4p reads as follows: find 
u G £'^{'N2) such that 

Aau = PAf . (6.10) 

Let Ea = : rI^I — > ^^(N2) be the extension operator and let ua G mI"^! be the Galerkin 
solution to (I6.9P : then, it is easy to check that u = Eaua- 
We are now ready to state the main result of this section. 

Proposition 6.2 (sparsity class of the residual). Let A G T)(.{7]l) and G ^(.{fii), for 

constants ?/l > and fj^ G (0, 77^] according to Property \3.5i If u G Aq^ for some r] > and 
t G (0, 1], such that rj < fji, then there exist suitable positive constants fj < r] and t <t such that 
r{u^) G JJq for any index set K, with 

\Hu\)\\^n,t < \\u\\^v,t . 

Proof. This is an adaptation of a similar proof in the periodic case given in [5, Proposition 5.4]; 
we report here the details for completeness. Assume for the moment we are given f] and t. By 
using Proposition 16. H i.e. ||Av|| ,-y,t < ||v|| f^^.ti, and Propertv 15.11 we get 



' < ~ "11 (6.11) 

- ll"ll4*^''i-*i(N2) ^ ll"'^ll4'^''i'*i{N2)' 



where ti and r/i are defined by the relations 



l + ti 



<ti, ri = C{ti)rii . 



From (j6.10p we have ua = (Aa) ^(PAf)- Using Propertv 16.11 and applying Proposition 16.11 to 
(Aa)"^ we get 

l|UA 11^2*1,,,*,^^^^ = M^2H,„^,^^^^ = ||(AA)-^(PAf)||^2jl,,.,,^^^^ < ||PAf||,g„*2(p,^) < l|f|l,^^2,*2(p^^) , 

with 

2*^r/i = c{t2)m <m , h = < t2 . 

i + t2 

By substituting the above inequality into (j6.1ip and using again Proposition 16.11 we get 

(6.12) 

where 



\''^\\ifm ^ ll^ll^^'^-i'^icN^) + ll^ll4''*^(N2) 



m = C{t)v <v , t2 = < * • 

This shows that the assertion holds true for the choice 

1 \ l+2t J. ( t t - t 



C[-r-irM[-r—Aat)i^, i 



2/ m + 2t/m + t/^' ' " i + 3t 



20 



It remains to verify the assumptions of Proposition 16.11 when A is dense. We note that there 
holds 

t t 

ti = <t2 = < t. 

l + 2t l + t 

Moreover, using rji < 2*^r]i < r]2 < r] and rji > fj^ > tj yields 

V < VL, m < r]L, m < f]L, 

which are the required conditions to apply Proposition 16.11 when A is dense. This concludes the 
proof. □ 



7 The predictor-corrector adaptive algorithm 

In this section we study a variant (named PC-ADLEG) of the ideal algorithm ADLEG in- 
troduced above. Motivated by the enlightening discussion at the end of Section d] and the 
subsequent results of Sections [5] and [6l we devise each iteration of the algorithm as formed by a 
predictor step followed by a corrector step. The predictor step guarantees an arbitrarily large 
error reduction (by suitably enriching the output set from the Dorfler procedure). This step 
is driven by the sparsity class of the residual and so, in view of Proposition 16.21 it does not 
guarantee optimality with respect to the sparsity class of the exact solution. The corrector step 
is realized by introducing a coarsening procedure which removes the smallest components of the 
output of the predictor step, in such a way to guarantee optimality. 

7.1 Enrichment 

We introduce the procedure ENRICH defined as follows: 

• A* := ENRICH(A, J) 

Given an integer J > and a finite set A C N2, the output is the set 

A* := {A; € N2 : there exists £ £ A such that \k - i\ < J} . 

Note that since the procedure adds a 1-dimensional ball of radius J around each point of A, the 
cardinality of the new set A* can be estimated as 

|A*|<2J|A|. (7.1) 

Recall now that TpAiJ^f]) = Cac"''^"^ from Property p.6p . Let > be defined as 



Ja := min < 



J G N : ^A-^{Je,f)L) = ^A-ie-^^-^" < ) ■ (7.2) 



A* := E-DORFLER(r, 61) 

Given 9 G (0, 1) and an element r G H~^{I), the ouput A* C N2 is defined by the sequence 

A :=D6RFLER(r,6') 

(7.3) 

A* :=ENRICH(A, Jg) . 
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7.2 Coarsening 

We introduce the new procedure COARSE defined as follows: 

• A := COARSE(w,e) 

Given a function w £ V\* for some finite index set A*, and an accuracy e > which is 
known to satisfy — < e, the output A C A* is a set of minimal cardinality such that 

\\w - Paw\\ < 2e . (7.4) 

The following result shows that the cardinality |A| is optimal relative to the sparsity class of 
u. We refer to Cohen O Theorem 4.9.1] and Stevenson [SDJ Proposition 3.2]. 

Theorem 7.1 (cardinality after coarsening). Let e > and let u G -^g^ ■ Then 

The approximation error obtained after a call of COARSE is estimated as follows. 
Property 7.1 (error after coarsening). The procedure COARSE guarantees the bounds 

\\u - Paw\\ < 3e (7.5) 

and, for the Galerkin solution G V\, 

l\u — ua\1 < 3\/a*e . (7.6) 

Proof. The first bound is trivial, the second one follows from the minimality property of the 
Galerkin solution in the energy norm and from (I3.3P : 



1^^ — < \lu — -Pa^^III < — Pa^W < 3\/a*e . □ 

7.3 PC-ADLEG: a predictor-corrector version of ADLEG 

Given two parameters 9 G (0, 1) and tol G [0, 1), we define the following adaptive algorithm. 
Algorithm PC-ADLEG(6', tol) 

Set ro := /, Aq := 0, n = -1 

do 

n n + 1 

dkn := E-D6RFLER(r„,6') 

A„+i := A„ U aA„ 
Un+i ■■= GAL(A„) 

A„+i := COARSE {un+i, ^^/^^||r„ 
Un+i ■■= GAL(A„+i) 
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rn+1 := RES(ti„+i) 
while llr^+ill > tol 

Theorem 7.2 (contraction property of PC-ADLEG). Let < ^ < 1 be chosen so that 

p = p(0) = 6— Vl-^2<i. (7.7) 

a* 

// the assumptions of Property \3.5\ are fulfilled, the sequence of errors u — Un generated for n > 
hy the algorithm satisfies the inequality 

lu - n„+i||| < pIu - Unl ■ 

Thus, for any tol > the algorithm terminates in a finite number of iterations, whereas for 
tol = the sequence Un converges to u in H^{I) as n —)• oo. 

Proof At the n-th step, we have dAn = E-D6RFLER(r„6') and A„+i = A„ U 9A„, where 

dAn = D6RFLER(r„,6l) 



(7.8) 

dAn = ENRICH(aA„, Je) . 



We recall that the set 9A„ is such that gn = Pq^ r^ satisfies 



(see (j4.4p l. Let Wn € V he the solution of Lwn = g-m which in general will have infinitely many 
components, and let us split it as 

Wn = Pt ,Wn + Ptc Wn =■■ Vn + Zn £ V-f, , © V-Tc . 

Then, by the minimality property of the Galerkin solution Un+i S ^ ^ in the energy norm, 
and by ()3.3p and ()3.7p . one has 

|||u - Sn+ll < ju - {Un + yn)\l < ju - Un - Wn + Zn\l 

< ^^\\L{u -Un- Wn)\\ + \/a*||z„,|| = ^=||r„ - gn\\ + Va*\\zn\\ . 



Thus, 

\u - n„+i||| < —;=\l (1 - 6*^) IknII + \/a*||2;„|| . 
Since Zn = (Ptc L~^P^ )r„, observing that 

k £ A^+i and i £ 9A„ \k - i\ > Jq 



we have 



iP^c L-'Pg^^ II < II A-^ - (A-^)jJI < iJA-^{Je,fjL) < 
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where we have used (|7.2p . Thus, we obtain 

2 



II M - Un+l\ 



< 



Vl-O^WrJ (7.9) 



/a* 

or, equivalently, 

\\u-un+i\\ < — Vi-e^lKW ■ (7.10) 

a* 

Since the right-hand side of ()7.10p is precisely the parameter e„, fed to the procedure COARSE, 
Property (jT.ip implies 



\\u - M„+i||| < 6^^ — VI - 02||r„|| < 6— VI - G'^lu - Un\ , (7.11) 
for the Galerkin solution Un+i € VAu+i ■ The assertion thus follows immediately. □ 

The rest of the paper will be devoted to investigating complexity issues for the sequence of 
approximations Un = ttA„ generated by PC-ADLEG. In particular, we wish to estimate the 
cardinality of each A„ and check whether its growth is "optimal" with respect to the sparsity 
class of the exact solution, in the sense that |A„| is comparable to the cardinality of the 
index set of the best approximation of u yielding the same error ||n — 

Theorem 7.3 (cardinality of PC-ADLEG). Suppose that u G , for some t] > and 
t £ (0,1]. Then, there exists a constant C > 1 such that the cardinality of the set A„ of the 
active degrees of freedom produced by PC-ADLEG satisfies the bound 

1 / iwwAv.t y/* 

|An|<^ log^ ^ + logC7 +1, Vn>0. 

v ' \ If - ""nil / 

If in addition, the assumptions of Proposition \6.S\ are satisfied, then the cardinality of the 
intermediate sets A„+i activated in the predictor step can be estimated as 

2 7 / ll^ll 4'''* \ 

|A„+i|<|A„| + -4 log^ ^^+logC +2Je, Vn>0, 

v ' \ If - ^^n+ill J 

where Jg is defined in (17. 2p and i) < r], i < t are the parameters which occur in the thesis of 
Proposition \6.Sl 



Proof. Combining Theorem 17. II with e„ = ^Vl — 0^||r„|| and Property l7.lt we get 

We now estimate |A„+i|. Recalling (I7.ip . we have |9A„| < 2Jg\dAn\, and using ()6.2p together 
with Proposition 16.11 and (j7.1ip yields 

(II II \ 1/* 

IfIU".' \ 
log- ^ + logC +1. 
\\u-Un+l\\ I 
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Finally, the second assertion follows from |A„+i| < |A„| + |5A„|. □ 

We observe that the cardinality of i.e., the set of degrees of freedom of the Galerkin 
solution at the end of each iteration, is optimal. On the other hand, in the case fj < r] and t < t, 
the cardinality of A^i+i may grow at a faster rate than the cardinality of A„. This is a direct 
consequence of the fact that, according to Proposition 16.21 the sparsity class of the residual may 
be worse than that of the solution. Moreover, the contraction constant in (|7.7p is as close to 
as desired provided 9 is close to 1, which is consistent with the expected fast error decay of 
spectral methods. This is a key feature of our contribution and a novel idea with respect to the 
standard algebraic case; see the surveys [T6| [20]. 

Similar results to Theorems 17.21 and 17.31 are valid also for the algebraic case, with optimal 
convergence rates only limited by solution regularity; they can be derived as in [5., Theorems 3.3 
and 7.2]. In addition, a more conservative version of PC-ADLEG, which avoids the procedure 
ENRICH, t can be studied as well following [5l Theorem 8.1]. This version exhibits better 
optimality properties than PC-ADLEG at the expense of a relatively large contraction factor, 
which is at odds with spectral accuracy. 
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